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Abstract: 

The fc-monotone classes of densities defined on (0, oo) have been known 
in the mathematical literature but were for the first time considered from a 
statistical point of view by |Balabdaoui and Wellner| | |2007| and |Balabdaoui ~ 
and Wellnerj | |2010| . In these works, the authors generalized the results 
established for monotone (k = 1) and convex (k = 2) densities by giving a 
characterization of the Maximum Likelihood and Least Square estimators 
(MLE and LSE) and deriving minimax bounds for rates of convergence. 
For k > 3, the pointwise asymptotic behavior of the MLE and LSE studied 
by |Balabdaoui and Wellner| | |2007| l would show that the MLE and LSE 
attain the minimax lower bounds in a local pointwise sense. However, the 
theory assumes that a certain conjecture about the approximation error of 
a Hermite spline holds true. The main goal of the present note is to show 
why such a conjecture cannot be true. We also suggest how to bypass the 
conjecture and rebuilt the key proofs in the limit theory of the estimators. 

Keywords and phrases: conjecture, asymptotic distribution, Hermite 
spline, fc-monotone. 



1. Introduction 

For an integer k > 1, a density go denned on (0,oo) is said to be fc-monotone 
if it is nonincreasing when k = 1, and if (— l^g^ is nonincreasing and convex 
for all j € {0, . . . , k — 2} when k > 2. Considering the problem of estimating a 
density in one of these classes presents several interesting features. As shown in 
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Balabdaoui and Wellner (20071 and Balabdaoui and Wellner (2010), both the 



MLE and LSE of a fc-monotone density exist. These estimators generalize the 
Grenander estimator of a nonincreasing density (fc = 1) and the MLE and LSE 
of a nonincreasing and convex density (fc = 2) studied by Groeneboom et al. 
( |200T| ). 

While it is known that the Grenander estimator in the case k — 1 converges 
pointwise at the rate n 1 / 3 and that the MLE of a convex nonincreasing density 
converges pointwise at the rate n 2//5 , the rate of convergence n k ^ 2k+1 ^ for the 
MLE (or LSE) of a fc-monotone density in the general case fc > 3 studied in 



Balabdaoui and Wellner (20071 depends on a key conjecture which has not 



yet been verified. In fact we show here that the spline conjectures made in 



Balabdaoui and Wellner (2007) fail to hold. On the other hand, Gao and Wellner 



( 2009 1 obtained a result concerning the global rate of convergence of the MLE 
of a fc-monotone density for a general fc > 3: they showed that the rate of 
convergence of the MLE with respect to the Hellinger metric is indeed n fe /( 2fc + 1 ). 

The limit case fc = oo corresponds to the intersection of all fc-monotone 
classes, that is the class of completely monotone densities on (0, oo). The latter 
turns out to be equal to the the class of mixtures of Exponentials, a consequence 
of Bernstein- Widder theorem, see e.g. |Cheney and Light| p009| . Th e nonpara- 
metric MLE of a mixture of Exponentials was considered by Jewell ( 1982 ) who 



showed its consistency and developed an EM algorithm to compute the esti- 
mator. So far, there are no results available on the limit distribution of the 



completely monotone MLE. As noted in Balabdaoui and Wellner (20071, one 



natural approach seems to study the behavior of the MLE in the fc-monotone 
class as fc — > oo and n — » oo. Such an approach requires evidently a deep under- 
standing of the asymptotic behavior of the fc-monotone MLE and of the distance 
between its knots. 



For an arbitrary fc > 1, the work of Balabdaoui and Wellner (2007) aims to 



give a general approach to derive the limit distribution of the MLE and LSE 
at a fixed point x > 0. More precisely, their work can be seen as an extension 



of the approach used by Groeneboom et al. (2001 ) in convex estimation. Let go 



denote the true fc-monotone density. At xo, and modulo the spline conjecture, 
it is shown that 



\ 



( c Q (x )H {k \o) \ 



(1) 



where g n is either the MLE or LSE, and 



j W = < 9o(xq) I ^ 



2j+l 



for j — 0, . . . , fc — 1. Note that the constants Cj(xa), j = 0, . . . , k — 1 appear 



also in the asymptotic minimax lower bound for L\ risk (see Balabdaoui and 
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Wellner (2010))- Let 



Y k (t) = 



, (jfe-l)! ( 2 fc)! ' t<U 



where W is a two-sided Brownian motion on R. The process H k appearing in 
the limit ([I]) is characterized by the following conditions: 

(i) The process H k stays everywhere above the process Y k : 

H k {t) > Y k (t), t G K. 

(ii) {—\) k Hk is 2/c-convex, i.e., (—l) k H^ k ^ exists and is convex. 

(iii) The process H k satisfies 



(H k (t)-Y k (t))dH^ k - 1) (t)=0. 



(Hi 2j \t)-Y^ j \t))=0fovj = ) 



(iv) If k is even, lirri| t | 

odd, lim t ^ 00 (77 fc (i)-i / fc(i)) = andlim| t | 
for j = 0, . . . , (k - 3)/2. 



(Ar— 2)/2; if fcis 



Because there is so far no device equivalent to the switching relationship 



device used in the monotone problem (see e.g. Groeneboom (1985) and also 
Balabdaoui et al. ( 2011[ )), the proof by Groeneboom et al. (20011 of the limit 
of the convex estimators is more complex and built in several steps. One of 
the most crucial pieces of this proof is that the stochastic order n -1 ' 5 for the 
distance between two knot points of the estimators in a small neighborhood of 
xq. This result holds true under the assumption that the true convex density go 
is twice continuously differentiable in a neighborhood of xq such that g' ' (xq) > 0. 

In the monotone problem, one can also show that the distance between the 
jump points of the Grenander estimator is stochastically bounded above by 
7i -1 / 3 provided that go is continuously differentiable in a neighborhood of xo 
such that g'oi^o) < 0. These working assumptions can be naturally put in the 
following general form: the true /c-monotone density is /c-times continuously 
differentiable in a neighborhood of xo such that (—l) k g Q k \xo) > 0. Thus, it 
seems natural that n _1 /( 2fe+1 ) gives the general stochastic order for all integers 
k > 1. As noted in Balabdaoui and Wellner ( 2007[ ), Mammen and van de Geer 
(1997) have, in the context of fitting a regression curve via splines, already 
conjectured that rT 1 ^ 2 ^ 1 ^ is the order of the distance between the knot points 
of their regression spline under the assumption that the true regression curve 
satisfies our same working assumptions. 



In the extension of the argument of Groeneboom et al. ( 2001 ) to an arbitrary 
k, we have found that there is a need to show that an envelope of a certain 
VC-class is bounded. In the next section, we describe this fact more precisely, 
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and give the connection to our two spline conjectures made in |Balabdaoui and| 
Wellner (2007). In Section 3, we show that these conjecture are false for k = 3. 



The argument can be generalized to k > 4 but the calculations rapidly become 
cumbersome. In Section 4, we give a number of suggestions for building an 
alternative proof for the limit theory of the fc-monotonc estimators. 



2. Connection to splines and the conjectures 



We begin with some notation. For integers m > and p > 1, let us denote by 
S m (ai, da, tip) the space of splines on [a, b] of degree m and internal knots 
ax < ■ ■ ■ < a p . The points a and b can be seen as external knots and will be 
denoted by ao and %>+i, respectively. Let / be a differentiable function on [a, b] 
(differentiable on (a, b) and to the right and left of a and b, respectively). If 
m = 2k — 1, p = 2k — 4, and a < a\ < • • • < 02^-4 < b, we know that there 
exists a unique (Hermite) spline £ 6>2fc-i(ai, ^2, ■■■i a ik-i) satisfying 



H k (aj)=f(aj) and H' k (aj) = f (aj) , for j = 0, 



,2k -3. 



Note that for fc = 2 the Hermite spline reduces to the cubic polynomial inter- 
polating / at a and b. We denote by T-Lu the spline interpolation operator which 
assigns to / its spline interpolant Hk- 

Let g n be the LSE of the true fc-monotone density go based on n i.i.d. random 
variables Xx,--- ,X n . It was shown by Balabdaoui and Wellner (2010) that g n 



exists, is unique, and is a spline of degree k—1. Let H n denote its fc-fold integral, 
that is 



H n {x) 



1 



(fc-1)! 



(x-t^g^dt. 



The function H n is important due to its direct involvement in the characteriza- 
tion of the estimator g n . More precisely, if we consider the (k — l)-fold integral 
of the empirical distribution <G n 

Y„(aO = ^ry, f\x - tf^dG^t), 

then the spline g n of degree A; — 1 is the LSE if and only if the following (Fenchel) 
conditions hold 



H n (x) > Y„(x), for all x > 0, 
H n (x) = Y n (x), if x is knot of g n . 



(2) 



The greater focus put on the LSE is explained by the fact that the characteri- 
zation in ([2]) is much simpler to study, especially when the empirical processes 



involved are localized (see Balabdaoui and Wellner (2007)). However, it was 



shown by IBalabdaoui and Wellner (2007) that understanding the asymptotics 



of the LSE is enough as one can use strong consistency of the MLE to linearize 
its characterization and put it in a more familiar form. 
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One of the key points in the study of the asymptotics is to note that the 
characterization of the LSE implies H n {r) = Y n (r) and H' n {r) = Y^(t) for a 
knot t of g n . Furthermore, given 2k — 2 knots tq < ■ ■ ■ < T2k-3, g n is uniquely 
determined on [To,T2fc-3] by the interpolation equalities Hn\Tj) = Y„\tj), 
i = 0, 1, j = 0, . . . , 2k — 3. In other words, H n is a Hermite spline interpolant of 
Y„, i.e., 

H n (x) = H k [^n}(x) for x e [t , r 2 fc- 3 ]- 

Note that in any small neighborhood of the estimation point xq, strong con- 
sistency of the (k — l)-st derivative of g n combined with the assumption that 

9q{ x o) guarantee that the number of knots in that neighborhood tends 
to oo almost surely as n — > oo. Hence, finding at least 2k — 2 knots is possible 
with probability one. At this stage, we know that T 2 fc_3 — r — > almost surely 
as n — > oo, and our goal is to show that this convergence occurs with a rate 
equal to n~ 1 /( 2fe+1 ^. In the next section, we describe briefly the key argument 



in the proof of Balabdaoui and Wellner ( 2007 ) and recall the two related spline 
conjectures. 



2.1. The spline conjectures 

Take an arbitrary point f 6 bo,T2fc_3] such that f ^ {to, • • • ,T2k~z\- By the 
inequality in ([2]), we have that 

U k [Y n ]{f) > Y„(f). 

If Y denotes the population counterpart of Y„, i.e., the (k — l)-fold integral of 
9o 

Y(x) = J\x - t^gatfdt, 

then the latter inequality can be rewritten in the more useful form 

[U k Y - Y](f) > Uk[Y - Y„](f) -[Y- Y„](f). (3) 

Both sides of the inequality can be recognized as the Hermite interpolation errors 
corresponding to the interpolated functions Y and Y — Y„. While Y is (2k)- 
times diffcrcntiablc on [ro,r2fc_3] under our working assumptions, the function 
Y — Y„ is only (k — 2)-times continuously differentiable since Y„ is the (k — l)-st 
fold integral of the (piecewise constant) empirical distribution function G„. 

Taylor expansions of Y and Y„ — Y up to the orders 2k and k — 1, respectively, 
give yet another form for ([3]). On [To,T 2 fc_3], consider the functions 

* t \ x2k . / (x - u)^ 1 

*^ X ' = (2fe)!' = (fc-l)! ' " e ( T 0' r 2fe-3), 

r(x) - ^yyy j^\x - t)f-\g^ (t) - $\f))dt. 
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Let efe = fo — Hkfo be the error associated with Hermite interpolation of f . 
Then, ^ is equivalent to 

5o fc) (f)e fc (f) <E n +R n 
where, with Go denoting the c.d.f. of go, 

E n = ['" 3 H k [b u ](f)d(G n (u) - G„(u)) and R n = U k [r]{f). 



Recalling that (— l) fc <?Q (xq) > 0, so that (— l) fc <^ fc " 1 is positive on a neighbor- 
hood [x — <5, x + 8] for some 5 > 0, pj can also be rewritten as 

(-l) fc 5o fc) (f)(-l) fc e fe (f) <E„+M„. 

The term E„ is an empirical process indexed by the class of functions h such 
that 

h(u) = h s , So ,... tS2k _ 3 (u) = H k [b u ](s)l [so>S2k _ s] (u), 

for some s < ■ ■ ■ < s 2 fc-3 m [#o - 5, x + S] and s e (s , s 2 k-3)- Here Ukif] 
denotes the Hermite spline interpolating / at Sj, j = 0, • • • , 2fc — 3. The second 
term K n is equal to the interpolation error corresponding to the (2fc)-times 
differential function r. The main goals are: (a) find upper stochastic bounds for 
E„ and K„; (b) find a lower bound for (—l) k ek(f) as a function of a power of 
the distance T2ks — tq. 

In the absence of any knowledge about the location and distribution of the 
random knots To, • ■ • ,T2k-3, it seems naturally desirable to get of rid of any 
dependency on these points. This motivates the assumption that the interpola- 
tion error is uniformly bounded independently of the knots. Thus, the following 
conjectures were formulated in Balabdaoui and Wellner (20071 to tackle (a). 

Conjecture 1 Let a = 0, b = 1, and b t {x) ={x- t)+~V(A: - 1)! for t £ (0, 1). 

There exists a constant dk > such that 

sup sup ||6 t - HkhWoo < dk- (4) 

te(oa) o< yi <---<y 2k -4<i 

Conjecture 2 Let a = and 6=1. Then there exists a constant c k > such 
that, for any f € C( 2fc )[0,l] ; 

sup ||/--Wfc/||oo<Cfc||/W||oo. (5) 

0<j/i<---<y 2 fc-4<l 

Note that Conjecture [l] cannot hold if Conjecture [2] does not: indeed, in view 
of the Taylor expansion 

f(x) = £ fU){0)* + I f M { t) {X *}+ dt 

j=0 
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and of the fact that polynomials of degree < 2k — 1 are preserved by H k , we 
observe that Q implies ^ with c k = d 2k . 

Let us fix so and R > such that [sq, sq + R] C [xq — 5, xq +5]. Conjecture [l] 
implies that the class 

^s ,R = {^ s , S o,...,s 2fc -3 : i s o, S 2 k-3\ C [s , s + R] c [x - <5, x + 5}} 
admits a finite envelope, e.g. 

Fs ,r{ x ) = akR k ~ 1 lis B ,s +R](x) 

where aj. > is a constant depending only on k (through dk)- Together with 
the fact that the class T So ,r is a VC-subgraph, this gives one of the most cru- 
cial results that helps establishing the stochastic order of the gap: the "right" 
stochastic bound 

E„ = O p (n- 2fc /( 2fe + 1 )) +o p ((r 2fc _ 3 -r f k ). (6) 

On the other hand, the term R„ could be bounded using Conjecture [2j Since R n 
is (2fc)-times continuously differentiable on a neighborhood of xq, Conjecture 2 
yields 

Kn =O p ((T 2k _ 3 -T ) 2k ). (7) 

It follows that 

sup (-l) k e k (f) < O p {n- 2k /^) + o p ((r 2fe _ 3 - r ) 2fc ), 

where [t Jo , r,- +i] is s largest knot interval among [tj, Tj+i], j = 0, . . . , 2k — 4. 

At this stage of the argument, the stochastic order of the gap can be shown 
to be 7i~ 1/(2fe+1) if there exists M > such that 

sup {-l) k e k (f) > M( T2k -3 - r ) 2fe . 

This can be shown using some known results on monosplines and Chebyshev 
polynomials (see |Balabdaoui and Wellner| ( |2005"1 12006[ |2007[ ». 

Conjecturing boundedness of the Hermite spline interpolant was a crucial 
assumption to obtain the right stochastic bound for the empirical process E„ and 
the remainder term K n . However, this boundedness served only as a sufficient 
condition. In the next section, we show that Conjecture 2 (hence Conjecture 1) 
is in fact answered negatively. 



3. Unboundedness of the Hermite interpolation error 

We now prove that the statement of Conjecture [2] (and even a weaker statement 
where c k would be allowed to depend on /) is violated for the function f = S* 
defined by 



S.(t) = S,(t;n, . . . ,r 2fe _ 4 ) = jzk)i[t 2k + > J -^'f - --<-<.'-" I ■ i*> 



2k-i \ 

• 2 x; (-!)*(* -t<)+ . 
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This choice is dictated by the fact (not necessary here, so not proven) that, for 
= r < Ti < • • • < T 2 k-i < T 2 k-3 = 1 and for t £ [0, 1], 



sup 

f€Wg, []/< 2fe >||oo<l 



[n k f](t)-f(t) = [u k s*}(t)-s*(t) 



(9) 



Setting £ k := Hk(S*) — S*, the Landau-Kolmogorov inequality (see e.g. Kol- 



mogorov (1939), Kolmogorov (1962), or Schoenberg (19731) guarantees the ex- 
istence of a constant D k > depending only on k such that 



l4 2fe l) \\co<D k \\£ k \\g 



\£ 



(2k) 



Since £, 



—Si 2k ^ alternates between +1 and — 1, so that ^| 

,(2/8-1), 



1, 



it follows that if Conjecture 2 was true, then ]]£% 'W^ would be bounded 



independently of the knots. Studying the latter turns out to be easier than 
studying ||£fc||oo itself, as £^ k ^ is a piecewise linear function (not necessarily 
continuous at the knots) whose slope alternates between +1 and —1. Let us note 
that £ k belongs to the space 



fifc(n, • • • ,r 2k -i) = jl&C*) + s (t)> 7 G M, s £ 5 2 fe_i(ri, • • ■ ,T 2k -i)\, 
which is a (4fc — 3)-dimensional weak Chebyshev space (see e.g. Lemma 1 in 



Bojanov and Naidenov (2002)). Let us also note that £ k has double zeros occur- 
ring at the knots tq,t\, ■ ■ ■ , r 2k _4, T 2k _ 3 . Since 4fc — 4 is the maximal number 
of zeros for a nonzero function in a weak Chebyshev space of dimension 4fc — 3, 
there exists a constant C 6 K such that, for all t £ [0, 1], £ k (t) equals 



C 



Bi(0) 
B 2 {0) 



B[(Q) 
B' 2 (0) 



Bi(l) 
Sail) 



B[(l) 
B' 2 (l) 



Bi{t) 
Bait) 



B 4k _ 3 (0) B' ik _ 3 iO) 
where (Bi,.. . , B 4k - 3 



S4fe-3(1) B'^.Jl) Bik-3(t) 



determined by £^ k \t) 



unboundedness of \\£, 



is any basis for £l k \Ti,--- ,r 2k -4). The value of C is 
= — 1 for < t < t±. Our objective is now to prove the 
which we do in the particular case k — 3. We 



(2/c-l), 



consider the basis for f2(n, t 2 ) (which has dimension 9) given by 

(l,f,t 2 ,t 2 (t-r 1 ),i 2 (i~T 1 ) 2 ^ 2 (t-T 1 ) 2 (i-T 2 ),(i~T 1 )^(^T 2 )^,5 >t (i))- 
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The determinantal expression of £^{t) can be explicitly written as 



£ 3 (t) = C 








where D is the 7x6 matrix 



1 
1 








x 
x 



X 
X 



D 



t 2 (t- Tl ) 



(t 



{t-T2)% 



(10) 



2n 

t? 











Tl 



rl 
r|(r 2 
r 2 2 (r 2 -n) 2 



(T2-T!) 5 



r|-2(r 2 -r 1 ) 6 
6! 



2r 2 

r 2 (3r 2 -2n) 

p(n,r 2 ) 

5(r 2 -n) 4 


T 2 5 - 2 (r 2 -r 1 ) 5 



1 

1-Tl 

(1-n) 2 
(l-n) 2 (l-r 2 ) 
(l-n) 8 
(1 - r 2 ) 5 

l-2(l- T1 ) 6 + 2 (l-r 2 ) 6 
6! 



2 

3-2n 
2(l-n)(2-n) 
9(n,r 2 ) 
5(1 - n) 4 
5(1 - r 2 ) 4 

l-2(l~Tl) 5 + 2 (l-T 2 ) 5 

5! 



p(n,r2) = 2T 2 (r2-Ti)(2T 2 -ri), g(n,T 2 ) = (1 -n)(2(l - r 2 )(2 - n ) + 1 - n). 



Taking the 5th derivative in ( 10 ) and expanding along the last columns yields, 
for < t < Ti , 

where Si and 5 2 are the determinants of the submatrices of D obtained by 
removing the fourth row and the last row, respectively. From £$\t) = — 1 for 
< t < Ti, we derive C = — \/8 2l and in turn 



^ 5) (0) = 120 



In the case t 2 = 2t±, an explicit calculation (facilitated by a computer algebra 
software) reveals that 

*i = ^^i 1 ~ 2r i) 6 ( 4 ~ 32r i + 189t i - 312r i + 159r i )' 
360 

S 2 = irl 3 (l -2t 1 ) 6 (1-t 1 )(7-5t 1 ). 
Thus, as ti — ?> 0, we have 

4^7360 1 



^ 5) (0) ~ 120- 



28t 13 21ti 
This shows that Conjecture 2 does not hold. 



-co. 



□ 
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4. Alternative arguments 

Although the results of Section [3] show that the methods of proof used in Balab- 



daoui and Wellner 



and Pollard (1990) and 



( 2007 \ (which are heavily based on the methods used in 



Kim 



Groeneboom et al. ( 2001[ )) do not suffice for proving 



the desired rate results as stated there, we continue to believe that the rat e will 
be n _1 /( 2fc +i) for the "gap conjecture" of Balabdaoui and Wellner (2007), and 
n k/{2k+i) f or mljt; of the k— monotone density fo- Here we sketch several 
possible routes toward proof of these conjectured results. 



4-1- Option A: lower bound for the gaps 

Note that the arguments in the preceding section showing unboundedness of the 
envelope of the interpolation error relied on taking ti = 2ti so that ti — t\ = 
T\ — > where the r's are regarded as parameters or variables indexing the entire 
class of interpolation errors for a scaling of the problem with 

= T < Tl < • • • < T 2 k-4 < T 2 k-3 = 1- 

Thus a "coalesence" of the knots leads to failure of the conjectures made in 



Balabdaoui and Wellner (2007) 



On the other hand, on the original time scale for the (random!) knots tq < 
Ti < ■ ■ ■ < T2/C-3 we want to show that T2k-3 — to = O p (n _1 ^ 2fc+1 ). It seems 
likely that these random knots for the LSE actually do not "coalesce" , but stay 
bounded away from each other asymptotically (at the rate n^ 1 ^ 2k+1 ' > ), and 
hence we expect to have 



1 



l<j<2fc+3 (jj — Tj-l) 



P (n 1/(2fc+1) ), 



(11) 



or, equivalently 



i<5<^fe+3 n 1 /( 2fc+1 )(r J - — Tj-i) 



= o p (i). 



(12) 



If we could show that ( 12 ) holds, then the classes of functions involved in the 



interpolation errors could be restricted to classes involving separated knots and 
the conjectures may be more plausible for these restricted classes. 



4-2. Option B: alternative inequalities 



While the methods of proof used in Balabdaoui and Wellner ( 2007 ) (and Kim 
and Pollard (1990), Groeneboom et al. (2001)) are based on empirical process 



inequalities which rely the small or scaling properties of envelopes (see e.g. 



Lemma A.l, page 2560 of Balabdaoui and Wellner (20071 or Lemma 4.1 of Kim 



and Pollard (1990)), as opposed to smallness of the individual functions in the 
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class relative to an envelope as in Lemmas 3.4.2 and 3.4.3 of |van der Vaart 
and Wellner ( 1996[ ) or van der Vaart and Wellner (2011 ). On the other hand the 



proofs of the (global) rate of convergence of Hellinger distance from the MLE /„ 
to /q established in Gao and Wellner] ( |2009[ ) rely on inequalities for suprema of 
empirical processes based on uniform or bracketing entropy for function classes 
in which the L2— norms of individual functions are small relative to envelope 
functions (which may possibly be unbounded) (see e.g. Lemmas 3.4.2 and 3.4.3 



and Theorem 3.4.4 of van der Vaart and Wellner (1996) for bracketing entropy 



type bounds, and see van der Vaart and Wellner 



(2011) for classes with well 



behaved uniform entropy bounds; the results of Gine and Koltchinskii| (20061 
might also be helpful in connection with the latter classes). 

Thus there is some possibility that alternative inequalities for suprema of 
the empirical processes involved may be needed in establishing the desired rate 
results when k > 3. 



Option C: alternative inequalities involving "weak parameters" 



While the inequalities discussed in option B above involve application of em- 
pirical process inequalities involving "strong parameters" such as the expected 
values of envelope functions, there remains some possibility for the development 
of new inequalities based on "weak parameters" ; see e.g. the discussion on page 
51 of Massart (2007) and the material on page 209 of Boucheron et al. (2013). 
This option is the most speculative of the three. 
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